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Abstract 

In this paper we propose a new test for the hypothesis of a constant coefficient of variation 
in the common nonparametric regression model. The test is based on an estimate of the L 2 - 
distance between the square of the regression function and variance function. We prove 
asymptotic normality of a standardized estimate of this distance under the null hypothesis 
and fixed alternatives and the finite sample properties of a corresponding bootstrap test 
are investigated by means of a simulation study. The results are applicable to stationary 
processes with the common mixing conditions and are used to construct tests for ARCH 
assumptions in financial time series. 

Keywords and Phrases: stationary processes, nonparametric regression, constant coefficient of 
variation, multiplicative error structure, generalized nonparametric regression models. 

1 Introduction 

We consider the common nonparametric regression model 

(1.1) Y t = m(Xi) +a(X t )e l7 i = 1, 2, . . . ,n, 
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where m denotes the regression function and a 2 the variance function and the random variables e$ 
satisfy E[si\Xi = x] = and E[ef\Xi = x] = 1. In many applications the variance can be assumed 
proportional to the squared mean which corresponds to the assumption of a constant coefficient 
of variation. Typical examples include models obtained by the logarithmic transformation from 
regression models with a multiplicative error structure [see Eagleson and Miiller (1997)] or ARCH- 
type models [see Engle (1982)]. Several authors have discussed the problem of estimating and 
testing the regression function under the restriction that m and a are proportional - see e.g. Mc 
Cullagh and Nelder (1989), who considered generalized linear models, Carroll and Ruppert (1988), 
who considered a constant coefficient of variation with a parametric model, and Eagleson and 
Miiller (1997), who investigated the common nonparametric regression model under the restriction 
that m = co for some constant c. 

In the present paper we will develop a formal test for the hypothesis of a constant coefficient of 
variation in the nonparametric regression model (11. ip . that is 

(1.2) H : m(x) = ca{x) 

for some positive (but unknown constant) c. Besides the fact that this test can be used to check 
the assumptions for a statistical inference in a nonparametric regression model with a constant 
coefficient of variation, it can also be used as an indicator of a multiplicative error structure 
(if it is applied to the squares of the data) and an exponentially distributed response Y where 
E[Y\X = x} = ^Var[F|X — x] = m(x). In Section 2 we introduce the test statistic and indicate 
possible applications. Section 3 contains our main results in the case of an i.i.d. sample {X iy Yi}f =1 . 
We prove asymptotic normality of a standardized version of the test statistic under the null 
hypothesis and fixed alternatives. In Section 4 we extend these results in the case of stationary 
time series with the common mixing properties and discuss an application to test for an ARCH(l) 
model. The finite sample properties of a bootstrap version of the new test are investigated in 
Section 5 and some of the technical details for the proofs of our main results are presented in the 
Appendix in Section 6. 

2 Testing for a constant coefficient of variation in non- 
parametric regression 

Numerous authors have considered testing various hypotheses regarding the mean and the variance 
function in the nonparametric regression model (11.11) [see e.g. Dette and Munk (2003) and the 
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references in this paper]. These hypotheses include parametric and semi parametric assumptions 
regarding the mean and variance function, but much less effort has been spent in investigating the 
relation between mean and variance in the nonparametric regression model (11.11) . In the present 
paper we investigate the hypothesis (11. 21) of a constant coefficient of variation using an estimate of 
the L 2 -distance between the variance and squared regression function. Typical examples include 
multiplicative models of the form 

Y t = m(X t )r] t 

which can be written in the form (11.1 p with cr(-) = ^JVar (jjt)m(-) and e t = {f]t — 1)/ A/Var(r^). 
Other examples include nonparametric ARCH models X t = m(X 2 _ 1 )r] t , for which the squared 
process corresponds to a multiplicative times series model. 

To be precise let {X^Yj}™^ denote a bivariate sample of observations from the nonparametric 
regression model (II. ip with the same distribution and let m and a 2 denote two nonparametric 
estimates of the regression and variance function, respectively, which will be specified in the 
following section. For any positive c we define the statistic T n (c) as 

(2.1) T n (c) = - r ^Y j K g {X i -X j ){c 2 Yl-{c 2 + l)m 2 {X i )}w{X i ) 

n(n — 1) ^-^ 

x {c 2 Y 2 - (c 2 + l)m 2 (X J )}w(X J ), 

where w denotes a weight function, K g (-) = ^K(-/g), K(-) denotes a kernel and g is a bandwidth 
converging to with increasing sample size. Note that the statistic of the form (12.11) has been 
considered before by Zheng (1996) for testing the parametric form of the regression function, 
by Dette (2002) for testing homoscedasticity, by Dette and von Lieres und Wilkau (2003) and 
Gozalo and Linton (2000) for testing additivity in a nonparametric regression model (II. ip with a 
multivariate predictor. If the estimate rh is consistent it is intuitively clear that for a large sample 
size 

E[T n (c)} « E[K g {Xt - X 2 ){c 2 a 2 (X 1 )e 2 1 - 2c 2 m(X 1 )a(X 1 )e 1 - m 2 ^)} 
x {c 2 a 2 (X 2 )e 2 2 - 2c 2 m{X 2 ) ( j(X 2 )e 2 - m 2 (X 2 )}} 
« ^[/(X.HcVTO - m 2 (X l )} 2 ^ 2 (X J )] 

(2.2) = E[A 2 (X l )/(X J ) W 2 (X l )], 

where / denotes the density of X and 

(2.3) A c (x) = m 2 (x) - c 2 a 2 (x). 
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Note that E[A 2 c (X i )f(X i )w 2 (X i )) = if and only if the null hypothesis (TOD is satisfied. There 
exist a few cases, where the constant c in the statistic T n (c) is known. For example in ARCH(l) 
models with standard normal distributed innovations r\ t we have X 2 = a + a\Xf_ x + (a + 
aiX 2 _ 1 )(rj 2 — 1), which gives c = I/a/2. However, in most cases of practical interest the constant 
c has to be estimated from the data. For this purpose we consider the least squares problem 

(2.4) c 2 = arg min - c 2 a 2 (X t )) 2 W ( Xl ) = ^^^^^ 

and estimate the unknown quantities on the right hand side. We define the residuals 

(2.5) r{X i ) = Y i -m{X i ), (i = l,...,n) 
and the estimate 

r9 fi , . 2 = (i/»)E: =1 m 2 (x t )f 2 (x>(x,) 

1 J ' ~ (i/n)Er =1 (^M)M^) ' 

where the variance estimate a is defined in ( 13.31) . Note that the squared residuals f 2 (-) are used 
for estimating the variance function in the numerator of c 2 in order to avoid an additional bias 
caused by the use of the variance estimator <r 2 (-) [see the proof of Theorem 3.2 in the Appendix]. 

It is intuitively clear that the expression c 2 estimates 



(2.7) c; 



, E[m 2 (X)a 2 (X)w(X)] 
" ^[^(X) W (X)] ' 



which coincides with the constant c 2 if the null hypothesis ( 11. 21) is satisfied and corresponds to 
the best L 2 -approximation of m 2 by functions of the form c 2 a 2 , otherwise. Consequently the 
hypothesis of a constant coefficient of variation will be rejected for large values of the statistic 
T n (c). 

In the following sections we specify the asymptotic properties of the statistics T n (c), c 2 and T n (c) if 
the local linear estimate [see Fan and Gijbels (1996)] is used for estimating the mean and variance 
function. 



3 Main results 

In order to state our main results we have to specify nonparametric estimates of the regression 
and variance function and several assumptions for the model ( 11. II) . We begin with the definition 
of the estimates. For the regression function we use the local linear estimate [see Fan and Gijbels 
(1996)] _^ 

(a i \ " ( ) - S=l Kh ^ Xi ~ g) [ g "- 2 ( X ) ~ ( X Z X i) S n,l(x)] Yj 

[ ' J m[X) ~ EILi Kh{Xi - x) [s n , 2 (x) -{x- X^s^x)} 
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where Kh(-) = j-K(-/h), K(-) is a kernel, h denotes a further bandwidth and 

n 

(3.2) s rhl (x)=Y l K h{X i -x)(x-X i ) 1 1 = 1,2. 



Similarly, the estimate of the variance function is obtained by replacing the observations Y{ by the 
squared residuals r 2 (Xj) defined in ( 12. 51) and is given by 



(3.3) a\x 



2, _ Eti K h(X i -x)[s n>2 (x)-(x-X i )s n , 1 (x)}f 2 (X i ) 



Yh=\ K h(Xi - X) [s n ,2(x) - (X - Xi)Sn :1 (x)} 

For the sake of transparency we first assume that {Xj, Vj}" =1 is a sample of independent identically 
distributed observations. A corresponding result in the time series context is given in the following 
section. Moreover, the same bandwidths are assumed for the calculation of the estimates of the 
regression and variance function for the sake of simple notation. The treatment of different 
bandwidths in these estimates does not cause additional difficulties (and in the simulation study 
presented in Section 5 we used in fact different bandwidths). Throughout this section we assume 
that the following assumptions are satisfied 

(Al) The density / is twice continuously differentiable on compact sets. 

(A2) The regression function m is four times continuously differentiable on compact sets. 

(A3) The variance function a 2 is positive and twice continuously differentiable on compact sets. 

(A4) The weight function w is twice continuously differentiable and has compact support con- 
tained in {x\f(x) > 0}. 

(A5) The kernel K is of order 2, and satisfies a Lipschitz condition. 

(A6) If n —>■ oo the bandwidth g and h satisfy 

(3.4) h ~ n~ 1/5 , g = o{h 2 ), ng -> oo. 

(A7) The function m^(x) = E[e \X = x] is continuous for k = 3,4 and for 1 < k < 8 uniformly 
bounded, that is 

(3.5) E[e k t \X t = x] < C < oo, k < 8. 
(A8) The regression and variance function satisfy 

E[m(X)] k < oo for k = 2, 4, and E[a 2 (X)] k < oo for k = 1, 2. 
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Our first result specifies the asymptotic distribution of the statistic T n (c), where the constant c 
in the hypothesis (11.21) is known. Roughly speaking the statistic T n (c) is asymptotically normally 
distributed with different rates of convergence under the null hypothesis and alternative. The 
proof is complicated and therefore deferred to the Appendix. 

Theorem 3.1. Assume that the assumptions (Al) - (A7) are satisfied. 

(a) Under the null hypothesis hi. 2) we have 

(3.6) n^T n (c)^M(0^l), 
where the asymptotic variance is given by 

(3.7) fj% = 2 E[{-1 + Ac 2 + 4cm 3 (X) + m A (X)} 2 m\X)f(X)w A {X)} J K 2 (u)du. 

(b) Under a fixed alternative Hi : m ^ ca we have 

(3.8) V^(T n (c) -E[T n (c)]) -^jV(0,^(c)), 
where 

(3.9) E[T n (c)} = E[A 2 iX)f(X)w 2 (X)] + h 2 B(c) + o(h 2 ) 
with A c defined in $2.3\) . Ki — J u 2 K{u)du and 

(3.10) B(c) = 2(c 2 + 1) K 2 E[A c (X)m(X)m"(X)f(X)w 2 (X)}. 

The asymptotic variance is given by 

/i?(c) = 4Far(A 2 (X)/(X)t i ; 2 (X)) + 16E[A2(X)m 2 (X)a 2 (X)/ 2 (X)u; 4 (X)] 
+4c 4 J E;[A 2 (X) ( T 4 (X)/ 2 (X){m 4 (X) - l}w\X)} 
-16c 2 E[A 2 c (X)m(X)a 3 (X)f 2 (X)m 3 (X)w 4 (X)}. 

In most applications the value c in the hypothesis (ll.2p is not known and has to be estimated 
from the data. The following results specify the asymptotic properties of the estimate c 2 defined 
in (12. 61) and the test statistic T n (c). 

Theorem 3.2. If the assumptions (Al) - (A8) are satisfied, then 

c 2 -E[c 2 ] = -^{r 1 (m 2 (X J )a 2 (X J V(X i K 2 -E[m 2 (X)a 2 (XV(X)]) 

i=l 

(3.11) + 2r 1 m{X i )a\X i )w{X i )e i - r 2 (V(X>(X) - E[a 4 (X)w(X)} 

- 2r 2 ( T 4 (X) W (X i ){ £ 2 - 1}} + o p f-^J . 



Moreover, 

(3.12) v^(c 2 -£[c 2 ]) ^A/"(0,z/ 2 ), 

(3.13) E[c 2 } = c 2 + h 2 T + o(h 2 ) 
and the constants V,ti, t 2 and v 2 are given by 

T = k 2 E[a 2 (X){r im (X)m"(X)-r 2 (a 2 (X))"}w(X)], 
1 

Tl ~ E[a 4 (X)w(X)Y 

E[m 2 {X)a 2 {X)w(X)} 
T2 ~ E 2 [a\X)w{X)} ' 

v 2 = T 2 Var(m 2 (X)a 2 (X)w(X)) +AT 2 E[m 2 (X)a 6 (X)w 2 (X)} 
+AT 2 E[m 3 (X)a 5 (X)m 3 (X)w 2 (X)} + r 2 Var(<j A (X)w(X)) 

+Ar 2 E[a\X){m 4 (X) - l}w 2 (X)} - 2r 1 r 2 Cov(m 2 (X)a 2 (X)w(X),a 4 (X)w(X)) 
-AT 1 T 2 E[m 2 (X)a 6 (X){m 4 (X) - 1} w 2 (X)} - Ar^ElmiX)^ (X)m 3 (X)w 2 (X)}. 



We are now in a position to investigate weak convergence of the statistic T n (c), where the estimate 
c 2 is defined in f !2.4p . We begin with the asymptotic distribution under the null hypothesis (jl.2p . 
Interestingly, in this case the estimation of the scaling factor c has no influence on the asymptotic 
properties of the test statistic. 

Theorem 3.3. Assume that the assumptions (Al) - (A8) are satisfied. Under the null hypothesis 
JSP we. have 

n^/g T n (c) = ny/g T n (c) + o p (1) JV(0, Mo), 
where the constant fi^ is defined in 7\) . 

Our final result in this section refers to the asymptotic properties of the statistic T n (c) under the 
alternative. In this case there appears an additional term in the bias and variance of the test 
statistic, which is caused by the estimation of the scaling factor c. Recall that the constant Cq 
corresponds to the best L 2 -approximation of m 2 by functions of the form c 2 cr 2 . 

Theorem 3.4. Assume that the assumptions (Al) - (A8) are satisfied. Under a fixed alternative 

g = E[A co (X)a 2 (X)f(X)w 2 (X)]>0 
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we have 

yfr(T n (c)-E[T n (c)])^MM), 

where 

E[T n {c)] = E[A 2 jX)f(X)w 2 (X)} + h 2 (B(co) - 2gT) + o(h 2 ), 
and B{cq) is a term in the bias of the statistic T n (c ). The asymptotic variance <J\ is given by 

uj\ = i4(c ) + 4g 2 u 2 - Aq v 2 (c ), 

where /xf(co) is defined in Theorem 3.1(b), v 2 corresponds to the asymptotic variance of c 2 in 
Theorem 3.2 and 

v 2 (c ) = 2r 1 E[A c SX){m 2 {X)- C y{X)m,{X))m 2 {X)a 2 {X)f{X)w\X)) 
-2r 1 E[A 2 ) (X)/(X) W 2 (X)]E[m 2 (X) ( T 2 (X) W (X)] 
-Ac 2 r l E[A Co (X)m(X)a 5 (X)f(X)m 3 (X)w 3 (X)] 
-2T 2 Cov(A 2 jX)f(X)w 2 (X),a\X)w(X)) 
+Aclr 2 E[A C0 {X)a\X)f\X){m i {X) - l}w\X)} 
+Ar 1 E[A C0 (X)m 3 (X)a 3 (X)f(X)m 3 (X)w 3 (X)] 
+8r 1 E[A CQ (X)m 2 (X)a\X)f(X)w 3 (X)] 
-%T 2 E[A Ca (X)m(X)a\X)fXX)mz(X)w 3 (X)}. 

Remark 3.5. The term v 2 (c ) corresponds to the asymptotic covariance between the statistic 
T n (c ) and the estimate c 2 of c\. 

4 Further discussion 

4.1 Asymptotic results for absolutely regular processes 

The general nonparametric framework includes time series models. Typical examples are multi- 
plicative models Z t = cr t r]t, where at is a positive function of the past {Z t -i : % > 1} and possibly 
of the past volatility {(X t _j : % > 1}. For instance, defining a t by a/^o + ^l-^t-i f° r > we 
achieve the linear ARCH(l) model. Therefore our test can also be used as a preliminary step to 
identify certain time series. For this purpose it is necessary to extend the asymptotic results under 
a more general setup which includes both time series data and i.i.d. observations as special cases. 
For this purpose we need the following assumptions for some fixed e G (0, 1/2) and £ > 2. 
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(Ml) The process (X^ Yi) is absolutely regular, i.e. 

P(k) = snpE[snp{\P(A\rj-P(A)\Ae^r + k}]^0, as k -> 0, 
where T\ is the cx-algebra generated by {(Xi, Yj) : s < I <t}. Further, 

oo 

i=i 

(M2) The innovations e t in the model (11.11) satisfy 

E[e t \X t ,J^(X,Y)} = E[e t \X t ] = 0, 

and 

Var(Y t \X t ,Ft£(X,Y)) = a 2 (x). 
Further, E\e t \ k < oo to the order k < 48£(1 + e). 

(M3) The regression function m(-) satisfies 

E\m(X)\ k < oo for fc < 4(1 + e) and 
E|m"(X)| fc < oo for k < 20^(1 + e), 

whereas the variance function <J 2 (-) fulfills 

^|a 2 (X)| fe < oo for k < 12^(1 + e). 

Note that assumption (M3) contains assumption (A8) which is therefore omitted. Under the as- 
sumptions (Al) - (A7) together with (Ml) - (M3) the asymptotic results for T n (c), c 2 and T n (c) can 
be established for strictly stationary, /^-mixing processes {Aj,Fj}j gZ . The proof of the following 
results is obtained from the proof of the statements presented in Section 3 for the independent case 
using similar arguments as given by Dette and Spreckelsen (2004), where the authors investigate 
the asymptotic distribution of goodness-of-fit tests of linearity for absolutely regular processes. 
For the sake of brevity the details are omitted and we refer the interested reader to the PhD thesis 
of Wieczorek (2007). Moreover, we only state the results for the statistic T„(c). Note that under 
the null hypothesis the asymptotic distribution of T n (c) under mixing assumptions coincides with 
the distribution for the i.i.d. case. 
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Theorem 4.1 Assume that the assumptions (Al) - (A7) and (Ml) - (M3) are satisfied. Under 
the null hypothesis, we have 

where /Xq is the asymptotic variance ofT n (c) defined in flff. 7| ). 



Our final theoretical result states the asymptotic properties of the statistic T„(c) under fixed al- 
ternatives. Note that in this case the variance of the limit distribution contains the variance of 
the limit distribution for the i.i.d. case as well as additional covariances. For a precise statement 
of the result we introduce the notation E® , which denotes the expectation with respect to the 
product measure. 

Theorem 4.2. If the assumptions (Al) - (Al) and (Ml) - (MS) are satisfied, then under a fixed 
alternative g > we have 

vfa (T n (c) - E®[T n (c)}) ^Af(0,ul). 

In particular, 

\E[T n {c)]-E®[T n {c)]\=o f ' 



where the mean E®\T n {c)\ and the constant g are given in Theorem 3.4- The asymptotic variance 
is given by 

(4.1) Cj\ = j2j(c ) +Ag 2 v 2 - Ag £ 2 (c ), 

where Pl\{cq) denotes the asymptotic variance ofT n (co) defined by 



oo 

fil(c ) = /i?(c ) +8j2Cov(A C0 (X l )[A C0 (X 1 ,e 1 ) + 2m(X 1 )a(X 1 )e 1 ]f(X 1 )w 2 (X 1 ), 

t=i 

4( v ,.v ,,,, 



Al(X 1+t )f(X 1+t )w 2 (X 1+t )), 



/Xi(co) is defined in Theorem 3.1(b). The term v 2 in \-j.l\j corresponds to the asymptotic variance 
of the estimate c 2 given by 



~2 2 
V — V 



oo 

+ 2^ Cov(2nm(X 1 )a 3 (X 1 )w(Xi)e 1 - r 2 a\X 1 )w(X 1 ){2e 2 1 - 1} 
t=\ 



+T 1 m 2 (X 1 )cr 2 (X 1 )w(X 1 )ej, T im 2 {X 1+t )a 2 {X l+t )w{X 1+t ) - T 2 a\X 1 )w{X 



Vh 
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where v 2 is given in Theorem 3.2 and v 2 (c ) corresponds to the asymptotic covariance between 
T n (co) and c 2 defined by 

oo 

v 2 (c ) = v 2 (c )+2j2Cov(A CQ (X 1 )f(X 1 ){A Co (V 1 )+2m(X 1 )a(X 1 )e 1 }w 2 (X 1 ), 

t=i 

{r im 2 (X 1+t )a 2 (X 1+t ) - r 2 a\X 1+t )}w(X 1+t )) 

oo 

+ 2j2Cov{At o (X 1+t )f(X 1+t )w 2 (X 1+t ), 
t=i 

{T 1 m 2 (X 1 )a 2 (X 1 )e 2 1 + 2r 1 m(X i y 3 (X 1 ) £l - r 2 a 4 (X 1 )(2^ - l)}w(X 1 )) , 
and v 2 (c ) is defined in Theorem 3.4- 

Remark 4.3. It is worthwhile to mention that in the case where the stationary process is 
absolutely regular with a geometric rate, i.e. /3(j) = 0(p>) for some p e (0,1), the asymptotic 
covariance of the test statistic given in Theorem 4.2 coincides with the asymptotic covariance 
given in Theorem 3.4 for the independent case, that is: 

Mi(c ) = A*i(co) » v 2 = v 2 , z> 2 (c ) = z/ 2 (c ) . 

Remark 4.4. The moment assumption (M3) is quite restrictive and limits the applicability of 
the test to many interesting time series models such as ARCH or GARCH models. One possible 
way to circumvent assumption (M3) is the introduction of an additional weight function in the 
estimates. As a consequence a slight modification of the estimates can be arranged in our testing 
procedure eliminating assumption (M3). The details can be found in Wieczorek (2007), and only 
the modification is mentioned for the sake of brevity. We introduce in a first step an additional 
weight function w*, satisfying 

(A9) w* is twice continuously differentiate and has compact support contained in {x\w(x) > 0}. 
Next, we propose a modified estimate of the regression function given by m*(x) = a, where 

n 

(4.2) (a, b) = argmin V {Yi - a - b(X t - x)} 2 w(Xi)K 

is the local linear estimate (additionally weighted by w) of the regression function and its deriva- 
tive. Note that the modified local linear regression estimator m* differs from the local linear 



Xi 



x 



h 
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estimate rh in (13 .ip by the introduction of the weight function w in (14. 2p . Similarly, we propose 
(a 2 )*(x) = a as the modified estimate of the variance function, where 

(a J) = argmin^ {(f*) 2 (X 4 ) - a - b(X t - x)} 2 w*{X^K 

' i=l ^ ' 

is the local linear estimate (weighted by the second weight function w*) based on the nonparametric 
residuals f*(Xi) defined by 

f*(X i ) = Y l -m*(X i ). 

Based on the modified estimates of the regression function and the variance function the new test 
statistic is defined by 

K{c) = — ^Y, K ^- x ^H c X 2 -(c 2 + i)m 2 (x l )}w*(x l ) 

(4.3) x { C 2 ^ 2 -(c 2 + l)(m*) 2 M}^M- 

In addition, we consider the modified least squares problem 

n 

(c 2 )* = arg min T(m 2 (X t ) - cV^)) 2 ^*) 3 ^). 
ceK>° L — 4 

i=l 

Therefore, we define the estimate of c 2 by 



r2y = (i/^)Er=i(^) 2 (^)r) 2 (^)K) 3 (^) 
{c> (iHEr=i((^)*(^)) 2 (^) 3 ra ' 

As an immediate consequence of the modified definitions the asymptotic results in Theorem 4.1 
and 4.2 can also be established for the test statistic T*(c*). No additional assumptions are needed, 
in particular, the introduction of the weight functions in the estimators avoids the assumption 
(M3) about the boundedness of the moments of the regression and variance function [see Wiec- 
zorek (2007) for details]. This modification makes the test applicable to financial time series, as 
demonstrated in the following section. 

Remark 4.5. An alternative test statistic, which is related closer to the estimation of the coeffi- 
cient of variation m(x)/a(x) was proposed by the referee. This approach is similar to the method 
discussed in Dette and Hetzler (2008) and is currently being investigated by the authors. To be 
precise, define Ui = Yi/&(Xi) (i = 1, . . . , n), U = ^ Y^j=i Uj anc ^ consider the statistic 

H n {x)= [ -^wiX^K^Xj -t)(Uj -U)dt. 
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A heuristic computation shows that 

which vanishes a.e. if and only if the null hypothesis of a constant coefficient of variation is satisfied. 
Consequently, a test for this hypothesis could be based on a Kolmogorov-Smirnov- or Cramer-von 
Mises statistic of the process H n . For a similar approach in the context of testing for a parametric 
form of the variance function we refer to Dette and Hetzler (2008). 

4.2 Example: Application to financial time series 

The hypothesis of the proportionality of the regression function m and the volatility function a 
can also be used to test for a multiplicative model structure. In particular, the proposed test 
can be viewed as a preliminary step in time series analysis before applying other procedures such 
as specific testing procedures for ARCH or GARCH models. One important criterion in order 
to establish all asymptotic results in such a context is assumption (M3). There the existence of 
bounds for the absolute moments of the regression function m, its second derivative m" and the 
variance function a 2 is required. But often financial time series do not satisfy this assumption. 
For instance, consider the linear ARCH(l) model 

for some constants ^Oj^i > 0,i?i < 1, where rj t has mean and variance 1 and is independent of 
Z t _i for all t. The squared ARCH(l) process can be written as 

(4.4) Z 2 t = (#„ + ® x Zl_ x ) + (# + tiiZf^et, 

where St = t] 2 — 1. Clearly, model ( 14.41) can be identified as a particular case of the general 
nonparametric regression model (11. ip by taking Y t = Z 2 , X t = Z 2 _ x , m(X t ) = i? + $iX t and 
<r(X t ) = c _1 (?9o + ^iAt). The scaling factor c is given by c 2 = (£^[^ 4 ] — 1) _1 and depends only on 
the error distribution. 

For the ARCH(l) process the assumption (M3) can therefore be formulated in terms of the bound- 
edness of absolute moments of Z t . So it is important to know whether the stationary solution Z t 
has moments of higher orders to apply the test. For stationary ARCH(p) processes with a sym- 
metric error distribution, a necessary and sufficient condition for the existence of such moments 
has been given by Milli0j (1985). In particular, let m > 0, then the mth moment of an ARCH(l) 
model exists if and only if E[$ir[o\ m < 1. As an immediate consequence, one sees that in many 
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cases ARCH processes do not have finite moments of higher orders. 

In such cases we refer to Remark 4.4. In order to circumvent the assumption of existing high-order 
moments of Z t we apply the (slightly) modified testing procedure. In particular, the identification 
of the regression function m and the variance function a 2 provides the assumptions (A2) and (A3) 
to be satisfied. Furthermore, from 

E[e k \X = x}= c k E[( V 2 - \) k \Z 2 t _^ = c k E[(r] 2 - l) k ] 

it follows that (e t ) fulfills (A7) and (M2) if the innovations r\ t satisfy certain moment conditions. 
If the assumptions (Al), (A4) - (A7), (A9) are satisfied and the ARCH(l) process (Z t ) fulfills the 
assumption (Ml) the asymptotic normality under the null hypothesis of a multiplicative model of 
the corresponding test statistic T*(c*) can be established, that is 

nV9(T:(c*) - E[t:(c*)}) 3 AT(0, (,$*), 

where the asymptotic variance (fify* is given by 



2\* 
OJ 



1152 J K 2 (u)du ^(^^- k ^E[Z 2k f(Z 2 )(w*) A (Z 2 )]. 



5 Finite sample properties 

In order to study the finite sample properties of the new test we have conducted a small simulation 
study. Because it is well known that the approximation of the nominal level by the normal 
distribution provided by Theorem 3.3 is not very accurate for moderate sample sizes, we do not 
recommend to estimate the asymptotic variance and bias and to compare the standardized statistic 
with the quantiles of a normal distribution. Instead, we propose to use resampling methods. As 
an example we have implemented a smooth bootstrap procedure to obtain the critical values. For 
this purpose we estimate the regression and variance function by the local linear estimates defined 
in (13.11) and (I3.3p . respectively, and consider the standardized residuals 

(5.1) rji = — — — — i = l,...,n, 

c(Ai) 

which are normalized to have mean and variance 1, that is 

(5.2) e t = , r,i ~ V i=l,...,n. 
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The bootstrap errors are then defined as 



(5.3) e* = e* + vN i , 

where £*,...,£* are drawn randomly with replacement from the empirical distribution of the 
standardized residuals ii,...,i n and Ni,...,N n are i.i.d standard normal distributed random 
variables independent of the sample y n = {(Xi,Yi), . . . , (X n ,Y n )} and v=v„ is a smoothing 
parameter converging to with increasing sample size. In the next step bootstrap data is generated 
according to the model 

(5.4) Y? = c&(X i ) + &(X i )e* i i = l,...,n, 

where c is the least squares estimate (I2.6P obtained from the data corresponding to the range 
[^(|o,05nj), ^(|o.95n|)] °f the predictors, where X^ < ... < X( n ) denotes the order statistic of 
Xi, . . . , X n . The test statistic T* is calculated from the bootstrap data (Xl, Y*), . . . , (X n , Y*). If 
B bootstrap replications have been performed, the null hypothesis fll.2j) is rejected if 

(5.5) T n > T*W-«)J) ) 

where T n < < T n denote the order statistic of the bootstrap sample. For the size of 
the bootstrap replications we chose B = 100, while 1000 simulation runs are performed for the 
calculation of the empirical level of this test. The sample sizes are given by n = 50, 100, 200 
and the smoothing parameters in the test statistic and the bootstrap procedure are chosen by 
g = n _1//2 and v = 0.1, respectively. The bandwidths for the estimation of the variance and 
regression function are chosen separately by least squares cross validation. 

Our first example considers the model 

(5.6) m{x) = c(l + 0.1a;) ; a(x) = (1 + O.lx), 

where c = 0.5, 1, 1.5. The predictors X±, . . . , X n are independent identically distributed following 
a uniform distribution on the interval [0,1], while the errors S\, . . . ,e n have a standard normal 
distribution. The first part of Table 1 shows the approximation of the nominal level, which is 
rather accurate for sample sizes larger than n = 100. For the smaller sample sizes n = 50 and 
larger values of c the approximation is not very accurate. In a second step we study the power of 
the test and consider the models 

(5.7) m(x) = c(l + O.lx) ; a(x) = (1 + O.lx + y/x) , 

(5.8) m(x) = c(l + O.lx) ; a(x) = (1 + O.lx + 2^/x~) . 
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The corresponding results are depicted in the lower part of Table 1. For the model (15.71) we observe 
a moderate increase in power, which corresponds to intuition. Because the predictor varies in the 
interval [0,1], the deviation from a multiplicative structure is extremely small for model (15.71) . On 
the other hand, the alternative model (15.81) is detected with larger power, which is also reflected 
by rather high simulated rejection probabilities. 



n 


50 


100 


200 




c \ a 


2.5 % 


5 % 


10 % 


20 % 


2.5 % 


5 % 


10 % 


20 % 


2.5 % 


5 % 


10 % 


20 % 




0.5 


.028 


.056 


.104 


.229 


.035 


.063 


.108 


.211 


.032 


.052 


.099 


.207 


1.0 


.042 


.051 


.105 


.210 


.031 


.049 


.099 


.194 


.033 


.054 


.104 


.204 


1.5 


.051 


.069 


.123 


.241 


.041 


.061 


.109 


.202 


.038 


.055 


.096 


.205 




0.5 


.043 


.067 


.144 


.267 


.051 


.074 


.148 


.269 


.122 


.187 


.281 


.442 


1.0 


.041 


.069 


.136 


.264 


.044 


.094 


.168 


.281 


.105 


.144 


.233 


.368 


1.5 


.064 


.091 


.141 


.247 


.077 


.109 


.166 


.273 


.114 


.152 


.208 


.341 


OEM 


0.5 


.049 


.092 


.167 


.283 


.066 


.097 


.213 


.368 


.097 


.155 


.261 


.421 


1.0 


.073 


.122 


.215 


.362 


.092 


.156 


.297 


.464 


.177 


.266 


.388 


.554 


1.5 


.063 


.107 


.203 


.378 


.123 


.187 


.281 


.442 


.233 


.316 


.424 


.559 



Table 1: Simulated rejection probabilities of the bootstrap test ( 15. 5\) . for three nonparametric re- 
gression models, where the first line corresponds to a multiplicative model. 

Our second example investigates the performance of the bootstrap test in the context of stationary 
time series. To this end we consider two models corresponding to the null hypothesis, that is 

(5.9) X t = (l + 0.1 X t _i) + (1 + 0.1 X t -i)£ t 

(5.10) X t = sin(l + 0.5 X t ^) + sin(l + 0.5 X t ^)e t 

and two models corresponding to the alternatives of no multiplicative structure, i.e. 

(5.11) X t = (l + 0.1 X t - X ) + 0.5V|X t _i| e t 

(5.12) X t = sin(l + 0.5 X t ^) + cos(l + 0.5 X tl )e t 

where the innovations are again independent standard normal distributed. The corresponding 
results are displayed in Table 2. We observe a reasonable approximation of the nominal level for 
the two models corresponding to the null hypothesis. On the other hand, the two alternatives in 
(15. lip and (15 . 12[) are detected with reasonable power. 
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n 



a 



50 



100 



200 



(jEIID 



2.5 % 
.029 
.038 
.053 
.084 



.047 
.057 
.077 
.117 



10 % 
.097 
.109 
.161 
.189 



20 % 
.217 
.201 
.295 
.299 



2.5 



.023 
.035 
.074 
.097 



.048 
.054 
.092 
.133 



10 % 
.089 
.092 
.182 
.212 



20 % 
.187 
.191 
.314 
.321 



2.5 



.024 
.036 
.113 
.129 



.048 
.057 
.156 
.176 



10 % 
.097 
.109 
.237 
.289 



20 % 
.189 
.205 
.395 
.417 



Table 2: Simulated rejection probabilities of the bootstrap test ( 15.5) ) for four nonparametric au- 
toregressive time series models. The models $5.9\) and $5.10\) correspond to the null hypothesis of 
a multiplicative model, while models $5.11\) and represent the alternative. 

6 Appendix: proofs 

6.1 Proof of Theorem 3.1. 

A straightforward calculation gives the decomposition 

(6.1) T n (c) = (c 2 + l) 2 T ln - 2(c 2 + l){2c 2 T 2n - T 3n (c)} + T 4n (c) - 4c 2 {T 5n (c) - c 2 T 6n }, 
with 



Tin 
Tin 

Ten 

T 3 n(c) 

T An (c) 

T5 n (c) 



1 


n 


'n — 


1) 




1 




n 


[n- 


1) 




1 




n 


[n- 


1) 




1 




n 


[n — 


1) 




1 




n 


[n — 


1) 




1 




n 


[n- 


1) 



l — J2 K g (Xi - X ] )m{X l )a{X l )w{X t )eMXMX ] )w{X ] )e 3 . i 
^— K g (X t - XjSniXiMXjAtiXj, e 3 )w{X 3 ), 



^— K 9 ( X i - Xj)A c (X t , £>(X,)m(X>(X>(X,)^-, 



where we have used the notation 



(6.2) 
(6.3) 



Sn(Xi) 



m 2 (X,)-cV(X i ) £ 2 
m 2 (X,)-m 2 (X 4 ). 
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At the end of the proof we will show that the terms T ln and T 2n are asymptotically negligible 
under the null hypothesis and under fixed alternatives, that is 

(6.4) n ^ Tjn ^0, J = 1,2. 

We now have to distinguish the case of the null hypothesis and alternative. 

Proof of Theorem 3.1(a). Note that the statistic T^ n {c) reduces under the null hypothesis to 

(6.5) T 3n (c) =° 1 V^(X t -X J )^(X t V(X t )m 2 (X J ) W (X J ){l-e J 2 }. 

n(n — 1) ^— i j j j 

with 

f« = — ±— ^^(X i -X i )m(X,)4(X>(X l )m 2 (X>(X i ){l- £ f} ) 
= - X J )^(X t ) W (X t )m 2 (X J ) W (X,){l - e 2 }, 

where we use the notation 

(6.6) S n (x) = m(x) — m(x). 

An application of Lemma 2 in Yao and Tong (2000) and a straightforward calculation [similar to 
the proof of the estimate (16. 4p given at the end of this section] yields 

(6.7) m = P (JL) , T2 = O p g) . 
Observing the conditions on the bandwidths we therefore have 



(6.8) 



T 3n (c) = o p ( — - 



We note again that this estimate holds only under the null hypothesis. 

We now obtain from (16.41) and (16.81) under the null hypothesis Hq : m(-) = c cr(-) that 



(6.9) 



T n (c) =° T 4n (c) - 4c 2 T 5n (c) + 4c 4 T 6n + o p (-^J 



and the assertion follows if the weak convergence can be established for the statistic 

(6.10) f n (c) = T 4w (c)-4c 2 T 5n (c)+4c 4 T 6n = 2 - V h n {V h V d ), 

n(n — 1 
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where V { = (JQ,£j), 

hij = K{V u V j ) = K g {X i -X j )n(V i )w{X i )n{V i )w(X j \ 
7r(Vi) = m 2 (X i ){l-e 2 -2ce i }. 

For a proof of weak convergence of T n (c) we note that the kernel h n (-, ■) is degenerate, i.e. 
(6.11) E[h n (V i ,V j )\V i ] = E[K(Vi,V-)\V;]=Q a.s. 

and apply Theorem 2.1 in de Jong (1987). For this purpose we calculate the variance as 
Var(T ra (c))= f_ E^ 

2 E [K 2 g (X t - X )m\X l )w\X i )m\X J )E[a\e % )\X l }E[a\e ] )\X J }w 2 {X )] 



n(n — 1) 

' f4 ( i 



+ o 



n(n — l)g \n 2 g t 
where we have used the notation 

(6.12) a{£i) = 1 - £• - 2csi 

2 

and (Iq is defined in (13.71) . Observing that by this calculation cr 2 - := E[h 2 j] = • (1 + o(l)) we 
therefore have 

which proves the first assumption in de Jong's (1987) Theorem 2.1. In order to establish the 
second assumption we derive the decomposition [see de Jong (1987)] 

(6.13) E[f*(c)] =G T + 6G n + 12G in + 2AG IV + 6G V , 



where 



G 



i 



n 4 ( 


>- 


I) 4 




2 4 






[n — 


I) 4 




2 4 




n A [ 


^ _ 


I) 4 




2 4 




n 4 ( 


[n — 


I) 4 




2 4 




n 4 ( 


'n — 


I) 4 



i<j 



kji ' 

i<j<k 



GlH = rtM-n - iH E E [h 2 ijh ki h k j] + E[h 2 k hjih jk ] + E[h 2 kj h i:j hi k ], 



i<j<k 

Giv = ~n TTj E] E[hijh ik hijhi k ] + E[hijhnh k jh k i] + E[h ik huhj k hji], 

i<j<k<l 
i<j<k<l 
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It is easy to see that [recall the notation of a(sj) in (16.121) ] 



E[h\ 2 ] = -i / / K A g (x 1 -x 2 )m 8 (x l )m 8 (x 2 )E[a\e 1 )\X 1 = x 1 ]E[a 4 (e 2 )\X 2 = x 2 }f(x 1 )f(x 2 ) 




w 4 (xi)w 4 (x 2 ) dxi dx 2 

1 ' 



which gives 



The other terms are estimated similarly, i.e. Gjj = O(^p-), Gm = O(^sp), Gjv = 0{-^-) and 
it follows from (I6.13P that 

(6.14) E[f 4 (c)] = QG v + o(^- 2 

On the other hand a straightforward calculation shows 

1 



nHn - l) 4 S (nv) ' « 4 (™ - x ) 4 ^ ° 
and we obtain 

1 



n 5 g 2 



a A n = 2G v + o 



A 9 / > 



which proves the second assumption in Theorem 2.1 of de Jong (1987), that is 

E[f±(c)] -3*i = o f4a 

Now this theorem yields asymptotic normality of n^fg T n (c) , i.e. n^fg T n (c) — > jV(0, /Xq), and the 
assertion of Theorem 3.1(a) follows from (I6.9P and (I6.10p . 

Proof of Theorem 3.1(b). We first note that under a fixed alternative the statistic T 3n {c) is 
not negligible. More precisely, we show at the end of the proof 

(6.15) T 3n (c) - E[T 3n (c)) = 2T 5n (c) + o p 

where 

E[T 3n (c)] = h 2 K 2 E[A c (X)m(X)m"(X)f(X)w 2 (X)} + o(h 2 ). 
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We obtain for the statistic T 6n in fl6.ll) 

2 

n 2 (n — 1) 



1 -E[m\x)a\x)f{x)w\x)] ( [ K 2 (u)du) + o( ' 



n(n — l)g \J J \n 2 g 

which gives 

(6.16) v^T 6n -^0 

(note that the expectation of T 6ra vanishes). A similar calculation yields 



Ti n (c) - 
(6.17) 

T 5n (c 



E[T in (c)\ = -Y,^c(X i ,e i )A c {X i )f{X i )w 2 {X i ) + oJ^ 



1 n i 
- V A c (X J )m(X i )a(X l )/(X> 2 (X t )^ + o p (^ 
i=i v 



n 

and E[T in (c)) = E\A 2 c (X)f(X)w 2 (X)} + 0(g 2 ). Consequently we obtain from fl6TT6l) 
and flBTTT] with 

£[T n (c)] = £[A 2 (X)/(X)«; 2 (X)] + h 2 B{c) + o(h 2 ) 
the following stochastic expansion under a fixed alternative 

(6.18) T n (c) - E[T n (c)] = (T 4n (c)-E[A 2 c (X)f(X)w 2 (X)])+4T 5n (c) + oJ-^ 



= Z n (c)+o p [—= 

where the random variable Z n (c) is defined by 

2 

n 



1 



2 n r 

Z n {c) = - J2\A c (Xi, e^A^XijfiX^iXi) - E[A 2 c (X)f(X)w 2 (X)) 
n l 
j=i 



(6.19) + 2m{X l )a{X l )A c {X i )f{X i )w 2 {X i )e, l 

and the assertion of Theorem 3.1(b) follows by a standard application of the central limit theorem 
verifying Ljapunoff's condition and observing that the dominating term on the right hand side of 

2 ( \ 

(JHUHJ) has expectation and variance 

The proof of Theorem 3.1 is now completed showing the remaining estimates (16.41) and (16. 15ft . 

Proof of the estimate ( 16. 4 j) . We consider exemplarily the case j = 1 (the other case is treated 
by similar arguments) and obtain the decomposition 

(6.20) T ln = 4^ + 4t£> + lfi> 
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with 



1 In ~ 


1 




n(n — 


1) 


T (2) 

1 In — 


1 




— 


1) 


T (3) _ 
1 In — 


1 




n(n — 


1) 



1 Y, K 9 ( X * - X J )m(X i )5 n (X 4 ) W (X,)m(X J )5 n (X J ) W (X J ), 



(6.22) iff = ( + K 2 l£ 2 > + ^T£ 3 ) ) {1 + o p (l)} 



[recall the definition of S n (x) in (16. 6p ]. The terms are all treated similarly and we consider 
again only the case £ — 1. With the notation 

fa m\ A t \ K h(X k -x) 
6-21 d k {x) = — 

and Lemma 2 in Yao and Tong (2000) it follows 

.2 

4 

with 

rfi- 15 = 3 ^ ^(X l -X J )4(X,)m(X i ) W (X l )d / (X,)m(X,)t i ;(X,)(T(X fe )£ fc( T(X i )^, 

lff 2) = ^ ^ X,(X - X J )4(X J )m(X i ) W (X,)a(X A; )£ fc m(X J )m ,/ (X,)«;(X J ), 

1,4 

Tt 3) = — — -^X 9 (X-X J )m(X)m"(X) W (X)m(X,)m"(X,)«;(X J ). 

71(71 1J . 

The terms iff , k = 1,2,3, can now be treated by calculating expectation and variance. For 
example, 

= ni ^_ 1} E l K s( X i ~ X i )4(X i )m(X l >(X i )4(X i )m(X,>(X,.) ( 7 2 (X fe )] + 
= ^ J K g (xi - Xj)K h (x k - Xi)K h (x k - Xj) 

y m ( nr. Am( t 1 W .( t ' Ami t Arr^fr, A f I TiArl/r-rlT .-dr.,. -I- O ( 

n 2 h? 

L 

V 2 



x m{x i )w{xi)m{xj)w{xj)a 2 {x} : )f{xk)dxidxjdxk + O 

\ I m 2 (x)a 2 (x)f(x)w 2 (x) dx f K 2 (w) dw + o\\ ] +o(^ 
nh J J \nh J \n z l 



< H U 
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For the calculation of the variance of T^' 1 "* we introduce the notation Vi = (Xi,Ei), 

tV(Vi, Vj, V k , Vi) = K g (X t - X i )4(X,)m(X i )t i ;(X i ) ( T(X fc )£ fc d / (X,)m(X J )^(X J )(T(X i )^ 
and obtain 

Var^) < E[{Tt lh2i 

1 



(6-23) = 1)2 E E ^Vi^V*,^) f&V^.^V. 



= o 



n 



l h 2 ' 



where we have used the fact, that the sum in (16.231) is dominated by those expectations, for which 
two indices of k,l,r,s coincide. Markov's inequality and the assumptions on the bandwidth now 
yield = Opi 71 " 1 ^ 1 ^ 2 )- The terms T^f^ and T 1 ^' 3 ' ) in f)6.22p are treated similarly, which 

implies = o p {n^ 1 g^ 1 ^ 2 ). A similar argument for the statistics T^j and yields assertion 
(16.4j) for j — 1. The second case j = 2 is treated in the same way. 

Proof of the estimate (16.151) . With the representation 

S n (x) = rh 2 (x) — m 2 (x) = 2m(x)5 n (x) + 6^{x) 
[recall the definition of S(x) in f)6.6p ] we have 
(6-24) T 3n (c) = 2T£ ) (c) + r£ ) (c) 

with 

T-£\c) = \ E^( X * -I>(I i )i(I>(I i )A c (I j ,£,>(I j ), 
n\n i) . 

A similar calculation as used in the proof of (16.41) yields 
(6-25) tM(c)=O p (±-)=o p ' ' 



nh I V v/n 



For the first term in (16.241) we introduce a further decomposition applying Lemma 2 in Yao and 
Tong (2000) and obtain 

(6-26) T«(c) = (t£- 1} (c) + |t£' 2) ) {1 + o p (l)} 
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with [recall the definition of c4(-) in (I6.2ip ] 

T^\c) = ^^if 5 (I I -I j )d i (%(I>(I i )a(I i ) £i A c (I j)£j >(I i ) 1 

T 3- 2) = „(?_ u E K *( X i ~ X j )m(X i )m"(X i )w(X i )A c (X j: e^,-). 

A straightforward but tedious calculation shows that Tg^'^fc) and T 5ri (c) are asymptotically equiv- 
alent, that is 

(6-27) E[(Tt 1 \c)-T 5n (c)) 2 } = o(-). 



n 



For the second term we obtain by a Taylor expansion 



(6.28) E[T^} = h 2 E[A c (X)m(X)m"(X)f(X)w 2 (X)} + o 



and 



Var(l£- 2) ) = o [ I 



which yields 

(6.29) V^(t£ 2) -E[t£ 2) ])=o p (1). 

Combining fl6\24j) - (16T29|) and noting that E[T 3n {c)} = n 2 E[T^ n 2 {c)] + o{h 2 ) now establishes flgJgP 
and completes the proof of Theorem 3.1. □ 



6.2 Proof of Theorem 3.2. 



Note that c 2 = ^ where 

^£2 



T 71 1 71 

i?i := - Vm 2 (X)f 2 (X>(X), i? 2 := - V(<T 2 (X i )) 2 W (X i ). 



n * — ' n 

i=l i=l 



We now apply Lemma 2 in Yao and Tong (2000) and Lemma 1 in Fan and Yao (1998), which 
give asymptotic representations of m(x) —m{x) and a 2 (x) —a 2 (x). A straightforward but tedious 
calculation yields 



(6.30) Ri-E[Rx) = -^{(m 2 {X i )a 2 {X i )w{X i )e 2 i -E[m 2 {X)(T 2 {X)w{X)] 

+ 2 m(X i )a 3 (X i )w;(X)£ i } + o p f-^=j , 



24 



1 n 

(6.31) R 2 -E[R 2 ] = -Y,{(°*(XiMXi) ~ E[o*(X)w(X)]) 

i=l 

+ 2 a 4 (X>(X,){£ 2 - 1}} + o p (-±=) 

with 

E[R!] = E[m 2 (X)a 2 (X)w(X)} + h 2 K 2 E[m(X)m"(X)a 2 (X)w(X)} + o(/i 2 ), 
E[R 2 ] = E[a 4 (X)w(X)] + h 2 K 2 E[a 2 (X)(a 2 (X))"w(X)] + o(h 2 ). 

With this representation and the notations r% := E[m 2 (X)a 2 (X)w(X)] and r 2 := £ , [cr 4 (X)tf (X)] 
it is easy to see that 

1 If R 2 -r 2 ( 1 \ / 1 



which implies 

(6.32) ' c 2 -cl = ^^- R ^ R \-^ + oJ^ 

r 2 ri \y/n 



where c 2 , is defined in (12.71) . Observing (16.301) we have 

Rx(R2-r 2 ) = E[R 1 ](R 2 -r 2 ) + (R 1 -E[R 1 ])(R 2 -r 2 ) 

= E[Ri](R 2 - E[R 2 \) + EIR^EIR^ - r 2 ) + O p (-^=) 

= n(R 2 - E[R 2 }) + E[Ri\(E[R 2 \ - r 2 ) + O p (-^=), 

and we obtain 

, 2 _ 2 Ri-EjR^ EjR^-n ri (R 2 -E[R 2 ]) EjR^EjR,} - r 2 ) 

C Cn — + „ 9 + 0„ 

r 2 r 2 rf r% \yjn 

= h 2 T + o(h 2 ) + Z n + o p (-^= 

where the random variable Z n is defined as 



Z n = -^{^(^(X^iX^wiX^e 2 - E[m 2 (X)a 2 (X)w(X)]) + 2r 1 m(X,)a 3 (X 4 ) W (X J )^ 

(6.33) - r 2 ^(X t )w(X t ) - E[a\X)w{X)]) - 2r 2 a 4 (X t ) W (X i ){e 2 - 1}}. 

The assertion finally follows from the central limit theorem and a straightforward but tedious 
calculation of the variance nVar(Z n ) = v 2 . □ 
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6.3 Proof of Theorem 3.3. 



(6.34) T n {c)-T n {c)=o p 



The assertion of the theorem follows, if the estimate 

1 

can be established. For this purpose we introduce the analogue of the decomposition ( 16.11) . and 
obtain 

(6.35) T n (c) = (c 2 + l) 2 T ln -4c 2 (c 2 + l)T 2n + 2(c 2 + l)^ 



2(c 2 + l)T 3n (c) + T 4n (c) - 4c 2 T 5n (c) + 4c 4 T 6n + oJ— 

\ Ti\/ 



where the last equality follows from (16.41) and Theorem 3.2. For the third term in (16.351) we have 

Tz n (c)=T 3n (c )-{c*-cl)T£\ 

where 

T ^ = rTr^TV - ^(x>(x^ 2 (x>(x^ 2 = o p (h 2 ) + o p QA 

[this estimate can be shown by similar arguments as presented in the proof of the estimate (16.41) 
Consequently we obtain from Theorem 3.2 

(6.36) T 3n (c) - T 3n (c ) = oJ-^— 

\ny/g 

The corresponding estimates 

(6.37) T 4n (c) - T 4n (c ) = o, ' 



P\ r- 

(6.38) c 2 T 5n (c) = clT 5n (c ) + o p ( — 

\n,, 

(6.39) (c 2 ) 2 T 6 „ = c 4 T 6n + O p (. 



h 2 



are proved by similar arguments, which are not given here for the sake of brevity. Note that the 
estimates f!6.38|) and (16.391) hold also under a fixed alternative. Recalling 

2 H 2 

c = c 

and combining (16.351) - (I6.38P with Theorem 3.2 yields 

(6.40) T n (c) =° T 4n (c) - 4c 2 T 5n (c) + 4c 4 T 6n + = T n (c) + o P {^=) 

where the last equality follows from the proof of Theorem 3.1(a). This proves the first identity in 
Theorem 3.3, while the remaining part follows again from Theorem 3.1. □ 
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6.4 Proof of Theorem 3.4. 

Recalling the decomposition (16.351) and (16.381) . (I6.39|) we obtain 
(6.41) T n (c) = 2(c 2 + l)T 3n (c)+T in (c)-4c 2 T 5n (c )+44T 6n + o. 



1 



p /—it 



and it remains to investigate the asymptotic properties of T 3n (c) and T 4n (c) under the fixed 
alternative. For this we note that (16.361) and the obvious estimate T 3n (co) = o p {l) yield 

(6.42) (c 2 + l)T 3n (c) - (c 2 + l)E[T 3n (c )} = 2(c 2 + l)T 5n (c ) + o p (-^j. 
The corresponding estimate for the term T 4n (c) is more difficult. Note first that 

(6.43) T 4n (c) = T 4n (c ) - 2 (c 2 - c 2 ) T«(o) + (c 2 - c 2 ) 2 T 4 ?, 
where 

if U — 1— £ - X> 2 (X>(X^ 2 a 2 (X>(X,04 

77^77 1J . 

A standard calculation yields = O p (l), which shows that the second term in (16.431) is of order 
o p (-5=). From Theorem 3.2. it follows 

(c 2 - cl)T^(co) = e(c 2 - E[c 2 }) + h 2 TE[A CQ {X)a 2 {X)f{X)w 2 {X)] + o{h 2 ) + o p (-^), 
where A C0 (X) is defined in (12.31) . Combining this with (16.431) therefore yields 

T n {c) - E[T n {c)} = T An (c )-E[n n (c )]+4:T 5n (co)-2g(c 2 -E[c 2 ]) + o p ^^ 
= Z n (c ) -2gZ n + o p (^=j, 

where the random variables Z n (co) and Z n are defined in (I6.19P and (16.331) . respectively, and 

E[T n (c)] = E[A 2 jX)f(X)w 2 (X)] + h 2 (B(c ) - 2qT) + o{h 2 ). 

The variances of Z n (co) and Z n have been determined in the proof of Theorem 3.1 and 3.2 and a 
straightforward calculation gives nCov(Z„(co), Z n ) = v 2 (co), where v 2 (co) is defined in Theorem 
3.4. The assertion now follows from a standard application of the central limit theorem. □ 
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